Fix TensorRT-LLM #7142

mc-nv · 2024-04-20T02:59:17Z

No description provided.

Tabrizian

@krishung5 might be more familiar with these changes.

Tabrizian · 2024-04-20T20:20:40Z

build.py

@@ -1708,6 +1708,30 @@ def tensorrtllm_prebuild(cmake_script):
    # Export the TRT_ROOT environment variable
    cmake_script.cmd("export TRT_ROOT=/usr/local/tensorrt")
    cmake_script.cmd("export ARCH=$(uname -m)")
+    cmake_script.cmd(
+        'export LD_LIBRARY_PATH="/usr/local/cuda/compat/lib.real:${LD_LIBRARY_PATH}"'


Why is this line required?

Because libcuda.so.1 can't be resolved till --runtime=nvidia is passed.
It's a workaround that allows us refer to the links in build container, can be deprecated in future.

You can create task and assign to me.

build.py

* Enable TensorRT-LLM build outside of CMake * TensorRT-LLM requires lower version of cuDNN

* TRT-LLM build * Update versions * Remove statment, as unused * Remove cache * add cmake option to set CXX11 ABI * Mchornyi krish 24.04 (#7149) * Enable TensorRT-LLM build outside of CMake * TensorRT-LLM requires lower version of cuDNN * Format --------- Co-authored-by: krishung5 <[email protected]>

* Fix TensorRT-LLM (#7142) * TRT-LLM build * Update versions * Remove statment, as unused * Remove cache * add cmake option to set CXX11 ABI * Mchornyi krish 24.04 (#7149) * Enable TensorRT-LLM build outside of CMake * TensorRT-LLM requires lower version of cuDNN * Format --------- Co-authored-by: krishung5 <[email protected]> * Update README and versions for 2.45.0 / 24.04 (#7096) * Update README and versions for 2.45.0 / 24.04 * Update ONNX Runtime version - 1.17.3 --------- Co-authored-by: krishung5 <[email protected]>

mc-nv requested review from Tabrizian, krishung5 and nvda-mesharma April 20, 2024 02:59

mc-nv changed the title ~~Krish 24.04~~ Fix TensorRT-LLM Apr 20, 2024

Tabrizian previously approved these changes Apr 20, 2024

View reviewed changes

krishung5 reviewed Apr 22, 2024

View reviewed changes

build.py Show resolved Hide resolved

krishung5 reviewed Apr 22, 2024

View reviewed changes

build.py Show resolved Hide resolved

mc-nv dismissed Tabrizian’s stale review via f364766 April 22, 2024 17:35

mc-nv requested review from krishung5 and Tabrizian April 23, 2024 02:02

krishung5 and others added 7 commits April 22, 2024 19:08

TRT-LLM build

2789103

Update versions

fd69a67

Remove statment, as unused

51bee39

Remove cache

36baf3e

add cmake option to set CXX11 ABI

4f6ad97

Mchornyi krish 24.04 (#7149)

35595d9

* Enable TensorRT-LLM build outside of CMake * TensorRT-LLM requires lower version of cuDNN

Format

37800d0

krishung5 force-pushed the krish-24.04 branch from f337af3 to 37800d0 Compare April 23, 2024 02:09

krishung5 approved these changes Apr 23, 2024

View reviewed changes

mc-nv merged commit bdab292 into r24.04 Apr 23, 2024
3 checks passed

mc-nv deleted the krish-24.04 branch April 23, 2024 02:20

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix TensorRT-LLM #7142

Fix TensorRT-LLM #7142

mc-nv commented Apr 20, 2024

Tabrizian left a comment

Tabrizian Apr 20, 2024

mc-nv Apr 20, 2024

Fix TensorRT-LLM #7142

Fix TensorRT-LLM #7142

Conversation

mc-nv commented Apr 20, 2024

Tabrizian left a comment

Choose a reason for hiding this comment

Tabrizian Apr 20, 2024

Choose a reason for hiding this comment

mc-nv Apr 20, 2024

Choose a reason for hiding this comment